An Efficient Method for Mining Frequent Weighted Closed Itemsets from Weighted Item Transaction Databases

نویسنده

  • Bay Vo
چکیده

1 Division of Data Science, Ton Duc Thang University, Ho Chi Minh, Viet Nam 4 2 Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh, Viet Nam 5 [email protected], [email protected] 6 7 Abstract: In this paper, a method for mining frequent weighed closed itemsets (FWCIs) 8 from weighted item transaction databases is proposed. The motivation for FWCIs is that 9 frequent weighted itemset mining, as frequent itemset (FI) mining, typically results in a 10 substantial number of rules, which hinders simple interpretation or comprehension. 11 Furthermore, in many applications, the generated rule set often contains many redundant 12 rules. The inspiration for FWCIs is that one potential solution to the rule interpretation 13 problem is to adopt frequent closed itemset. This study first proposes two theorems and a 14 corollary. One theorem is used for checking non-closed itemsets while joining two 15 itemsets to create a new itemset and the other theorem is used for checking whether a new 16 itemset is non-closed itemset or not. The corollary is used for checking non-closed 17 itemsets when using Diffsets. Based on these theorems and corollary, an algorithm for 18 mining FWCIs is proposed. Finally, a Diffset-based strategy for the efficient computation 19 of the weighted supports of itemsets is described. A complete evaluation of the proposed 20 algorithm is presented. 21

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new method for mining Frequent Weighted Itemsets based on WIT-trees

The mining frequent itemsets plays an important role in the mining of association rules. Frequent itemsets are typically mined from binary databases where each item in a transaction may have a different significance. Mining Frequent Weighed Itemsets (FWI) from weighted items transaction databases addresses this issue. This paper therefore proposes algorithms for the fast mining of FWI from weig...

متن کامل

Weighted Support Association Rule Mining using Closed Itemset Lattices in Parallel

In this paper, we propose a new algorithm which associates weight to each item in the transaction database based on the significance of the corresponding item. Weighted support is calculated using the weight and the frequency of occurrence of the item in the transactions. This weighted support is used to find the frequent itemsets. We partition the database among ‘N’ processors and generate clo...

متن کامل

Mining High Utility Itemsets from Large Transactions using Efficient Tree Structure

Mining high utility itemsets from a transactional database refers to the discovery of itemsets with high utility like profits. It is an extension of the frequent pattern mining. Although a number of relevant algorithms have been proposed in recent years, they incur the problem of producing a large number of candidate itemsets for high utility itemsets. Such a large number of candidate itemsets ...

متن کامل

Simultaneous mining of frequent closed itemsets and their generators: Foundation and algorithm

Closed itemsets and their generators play an important role in frequent itemset and association rule mining. They allow a lossless representation of all frequent itemsets and association rules and facilitate mining. Some recent approaches discover frequent closed itemsets and generators separately. The Close algorithm mines them simultaneously but it needs to scan the database many times. Based...

متن کامل

An Efficient Algorithm for Mining Weighted Frequent Itemsets Using Adaptive Weights

Weighted frequent itemset mining is more practical than traditional frequent itemset mining, because it can consider different semantic significance (weight) of items. Many models and algorithms for mining weighted frequent itemsets have been proposed. These models assume that each item has a fixed weight. But in real world scenarios, the weight (price or significance) of the items may vary wit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Inf. Sci. Eng.

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2017